On Reconfiguration Latency in Fault-Tolerant Syst ems1

نویسندگان

  • Hagbae Kim
  • Kang G. Shin
  • Chuck Roark
چکیده

Digital computers embedded in critical applications such as flight controls should be equipped with appropriate fault-tolerance schemes to ensure their reliable and safe operation in the presence of component failures. System reconfiguration, which enhances reliability by dynamically using spatial redundancy, is generally the most time-consuming faul t/errorhandling stage. The reconfigurutzon latency, defined as the time taken for reconfiguring a system upon failure detection or mode change, depends on many parameters, including the size of application programs and data, the CPU and memory speed, built-in testing capabilities, the type (cold, warm, or hot) of spares to use, the system architecture, and the reconfiguration strategy used. In this paper, we classify the reconfiguration techniques into four types: reconfigurable duplication, reconfigurable N-Modular Redundancy (NMR), backup sparing, and graceful degradation. For each type of reconfiguration, we ( i ) evaluate the reconfiguration latency by using several parameters accounting for the aforementioned parameters, and (ii) determine if this type of reconfiguration can meet the application required latency. Index Terms Reconfiguration latency, dynamic redundancy, processor and task parameters, backup sparing, graceful degradation, cold, warm, and hot spares 'The work reported was supported in part by a Texas Instruments Grant, the Office of Naval Research under Grant N00014-91-J-1115 and by the NASA under Grant NAG-1-1120. Any opinions, findings. and conclusions or recommendations expressed in this paper are those of the authors and do not necessarily reflect the view of the funding agencies. 0-7803-24734/95/$4.00 e 1995 IEEE 287

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Design of Fault Tolerant Comparator

In this paper we have presented a new design of fault tolerant comparator with a fault free hot spare. The aim of this design is to achieve a low overhead of time and area in fault tolerant comparators. We have used hot standby technique to normal operation of the system without interrupting and dynamic recovery method in fault detection and correction. The circuit is divided to smaller modules...

متن کامل

A fault tolerant NoC architecture using quad-spare mesh topology and dynamic reconfiguration

Network-on-Chip (NoC) is widely used as a communication scheme in modern many-core systems. To guarantee the reliability of communication, effective fault tolerant techniques are critical for an NoC. In this paper, a novel fault tolerant architecture employing redundant routers is proposed to maintain the functionality of a network in the presence of failures. This architecture consists of a me...

متن کامل

Towards a Strongly Fault Tolerant VLSI Processor Array

In this paper a novel methodology to achieve fault tolerance in VLSI Array Processors is proposed. A “Fence” based approach is adopted in which the logic array is partitioned and spares are distributed along the boundary of the active array. The emulator as in conventional fault tolerance techniques takes care of fault mapping and reconfiguration. The latency, reconfiguration interconnect lengt...

متن کامل

Control Reconfiguration: Survey of Methods and Open Problems

This report surveys recent results and approaches from the field of control reconfiguration within a general fault-tolerant control framework. Open problems for future research are identified. Control reconfiguration is embedded within a wider fault-tolerant control context and set apart from other approaches to achieve fault tolerance. It is explained why simpler approaches than control reconf...

متن کامل

Fault-Tolerant Systolic Array Design With Partially- Reconfigurable FPGAs

Partial dynamic reconfiguration, PDR, is an important feature of modern reconfigurable architectures such as the Xilinx Virtex FPGA devices. In this paper we focus on the application of PDR to fault-tolerant systolic arrays in one dimension. Single as well as multiple faults in one or more array processing elements, PEs, are considered. In our approach modular redundancy is not used and the goa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1995